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Abstract 

In this paper we describe the emergence of scale- free degree distributions from statistical mechan- 
ics principles. We define an energy associated to a degree sequence as the logarithm of the number 
of indistinguishable simple networks it is possible to draw given the degree sequence. Keeping fixed 
the total number of nodes and links, we show that the energy of scale-free distribution is much 
higher than the energy associated to the degree sequence of regular random graphs. This results 
unable us to estimate the annealed average of the number of distinguishable simple graphs it is 
possible to draw given a scale-free distribution with structural cutoff. In particular we shaw that 
this number for large networks is strongly suppressed for power -law exponent 7 — > 2. 

PACS numbers: :89.75.Hc,89.75.Fb,89.75.Da 
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Recently a large variety of complex systems, from the Internet to the protein interactions 
in the cell, have been described in terms of the underlining complex networks In 
many cases the topology of these networks is the outcome of a self-organized stochastic 
process since there is no an a priori design of the connections. Nevertheless these structures 
evolve in order to perform some special task. Therefore it is crucial to ask how much 
complex networks are far from optimal performance. To answer this question the optimality 
of the networks has been defined with respect to a variety of different specific prerequisites 
, lOj: i) respect to a specific function , oi a) to their dynamics 
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or 



in) to some topological robustness features j^. Nevertheless, many real networks show some 
universality in their topology [l, 2]. A major universal character of these networks is given 
by their scale-free degree distribution which is a general property of a vast class of complex 
systems jl^. In fact, a large variety of real networks are scale-free with diverging second 
moment of their degree distribution {k"^) while others are finite-scale, with a finite second 
moment (fc^). These two general network distributions have very strong impact also on the 
dynamics defined on these graphs as it has been widely discussed in the literature 11[ . Until 
now the way to explain the appearance of the recently identified scale-free structure is two- 
fold: on one side there are growing network models which assume that scale-free networks 
dynamically emerge from a growing process in which new links are added following the 
preferential attachment rule , on the other side there are equilibrium networks which 

by means of some externally imposed hidden variable present on each node show up a scale- 



free degree distribution [12 



, isi . We note here that because of the so widely observed degree 
distributions of the networks, further attention must be put on their details: i.e. on the 
deviations from random scale-free or finite-scale distributions which makes them specific 
to their function as it has been indicated for example by works on motifs and community 
identification 

In this paper we propose a method which describes the emergence of scale-free networks 
from statistical mechanics principles. We limit ourself to the case in which the embedding 
space, if it exists, plays a limited role in the network wiring and we allow any two nodes 
of the network to be linked. We take a very simple and general assumption: we assume 
that the degree distribution of the network minimize a partition function associated to the 
network. A fundamental role in this partition function is played by the energy associated to 
the degree distribution. We define this energy as the logarithm of the total number A/g of 
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indistinguishable simple networks it is possible to draw by wiring the edges given a certain 
degree distribution. Consequently, the energy is a measure of the redundancy present in the 
space of allowed simple networks given a degree distribution. This space is not an absolute 
abstract space but it is a well known space considered in many applications. Indeed for 
example it is sampled by the widely used randomization algorithm [16], by swapping pairs 
of links without changing the degree distribution. The formulation of our problem assumes 



the aspect of some type of statistical mechanics o: 



pursued also by other authors 
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the networks, a direction which has been 



22l |. However here we don't fix an a priori 
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degree distribution, or a preferential attachment dynamics like in Ref. |17l . 
we take an unsupervised approach in which we derive the more probable distribution of a 
graph of nodes and L links, which minimize a free energy defined in terms of a single 



external parameter z. On the other side the free energy differs from the free energy of |18l.ll9| 
because the partition function is defined directly over the degree distribution. In this work 
we show that scale-free degree distributions with different exponent 7 > 2 minimize the free 
energy of our problem. The energy of such graphs is an extensive quantity which decrease 
with 7 showing that networks with 7 — > 00 which are regular random networks with degree 
distribution infinitevely peacked around their average value, have much lower energy than 
networks with lower 7 and diverging second moment of the connectivity < k"^ >. These 
founding unable us to draw some conclusion on the nature of the space of simple graphs 
associated with given degree sequence. Making the estimation of the average number of 
simple graphs A/sg and showing that in the thermodynamic limit and when a structural 
cutoff is considered this number is strongly suppressed as 7 — ^ 2. 

In order to define our partition function let's consider a network with nodes and 2L 
edges with degree sequence {ki, . . . , /ctv} associated to the degree distribution N^. We define 
the energy E{{Nj:}) associated to the degree distribution of the network as the logarithm of 
the number Afc of indistinguishable simple networks it is possible to draw given the degree 
sequence, i.e. 

E{{Nk}) = logiAfc). (1) 

where a simple network is a graph without tadpoles and double links. The number A/g can 
be expressed as 

A/-c = e^«^^» = JjA;!^^ (2) 
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In fact, for every simple graph associated with the given distribution every permutation of 
the edges departing from each node generates the same set of hnks. These permutations are 
given by Ylk kl^'', and consequently we derive Eq. ([2]). 

On the other side, the number of ways J^in^} which we can distribute 2L edges into 
any degree sequence {ki, . . . , kj^} of distribution {Nk} is given by 

(2L)! 

where we consider unlabeled nodes. Thus we can define an entropy S'dA/'fc}) of each distri- 
bution as 

Once we have defined the energy E{{Nk}) and the entropy ^dA^fc}) associated to a network 
degree distribution, we can define a partition function of the network. 

Proceeding as in standard statistical mechanics, we define a normalized partition function 
Z of the network as the sum over all microstates of the problem, i.e. degree distributions, 
with given energy E{{Nk}) and entropy S{{Nk}) 

Z = ^ y 'e-^^«^^»+^«^^». (5) 
2L ! ^ ^ ^ 

^ ^ {Nk} 

In other words we would like to know which are the more likely distributions {N^} which 
minimize the free energy of the network F({A^fc}) = E{{Nj:}) — zS{{Nj:}). The role of the 
parameter z is to measure a tradeoff between the 'energetic' and the 'entropic' term in the 
definition of the free energy, as well as the temperature T in classical statistical mechanics. 
In equation ([5]) the sum over the {Nk} distributions is extended only to {N^} for which 
the total number of nodes and the total number of links L in the network is fixed, i.e. 



k 

Y,kNk = 2L. (6) 



To enforce these conditions we introduce in ^ the delta functions in the integral form 
u u Jo 27rJo 27r 



(7) 



4 



Performing 



d\ r dv 



i2L)\ J 2Tr J 2tt 



dX f dv 



exp 



-E({iV4) + S{{Nk]) - i\{2L - kNk) - iu{N - J] N^) 



— / — exp 



?A2L - iz/iV + ^ log G'fc( A, 



where 



iXk + iv log(A;!) 



(9) 



Assuming that the sum over all can be approximated by the sum over all = kN^ 
1, 2, . . . oo we get log [Gfc(A, z/)] = exp \i\ + iu/k — \og{k\)~\ and 



/(A, z/) = -i{k)X -i^ + j^^e 



(10) 



where < k >= 2L/N indicates the average degree of the network. By evaluating ([8]) at the 
saddle point, deriving the argument of the exponential respect to A and z/, we obtain 



^ k 

1 _ IjX+iu/k-^logikl) 
AT 2^ 



N ^ k 

k 

and the marginal probability that = kn = £ is given by 

P(L. = i = nk) = }_e-'/^^o,ik^^A^^, 

with 



(11) 



(12) 



ZkiL.i.N) = 1^1 ^exp[NMX,uJ)] (13) 

and MX, z/, i) = t{{k) - e/N)X - iu{l - ^/{kN)) + ^ Y.s^k ^M^>^ + I s - Ts log(sO]- « we 
develop f|T2|) for i <^ L and we use the Stirling approximation for factorials, we get that 
each variable Lk is a Poisson variable with mean < Lk > satisfying 



=< Nk >= k~^~'e'+h^/' 

k 



(14) 



which describe the optimal degree distribution for our problem. If we restrict ourself to 
the networks with finite average degree in the thermodynamic limit, the allowed values of 
z are z < 1. From the expression (HM of the optimal degree distribution, if 2; G (0, 1) the 



|^/|exp|iV/(A..)] (8) 
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FIG. 1: Average degree {k) of the optimal distribution of as a function of v for networks with 
z = 0.2 (solid line) z = 0.3,0.4,0.5,0.6,0.7,0.8 and z = 0.9 (dashed line). 



optimal degree distribution is scale-free with a power-law tail characterized by the exponent 
7 = i + In this case the distribution f[T^ always diverges in zero, thus we necessarily must 
impose that there are no isolated nodes in the network. The parameter z/ 7^ modulates the 
average degree of the graph constituting for z/ > an effective lower cutoff of the distribution 
wherever the upper cutoff K of the degrees is the natural cutoff of the distribution f|T^ . 
A different scenario arises if 2; < 0, when the optimal network (fl^ has a power-law degree 
distribution increasing with the degree k. In this case the Lyapunov functions v and A 
cannot fix the average degree unless one introduces by hand an upper cutoff K in the degree 
of the nodes of the order of magnitude of the average degree {k). We note here that also for 
z G (0, 1) it could be convenient to set by hand a structural cutoff K ~ A^^''^ for 2; > 1/2 in 
order to obtain an uncorrelated network. In the following we will consider only the power-law 
case z G (0, 1). 

From equations fllip we derive for the Lagrangian multipliers A and z/ in z G (0, 1) : 



1 ),4-l 



r(i|.)-r(i..) 

z K z 



F 



' K 



F 



1 ),4-l 



r(i)-r(i,^) 

z z K 



N 

{k) 

1, 



F 



1 u 
~zli 



F 



(15) 



where F(a, h) indicates the incomplete Gamma function. This system of equations is solvable 
provided {k) > 1, as can be seen from Figured] where we plot the average degree of a network 
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FIG. 2: The normalized degree distribution P{k) for z = 0.75 as a function of the average degree 
of the network. Data are shown for (k) = 1.3, 2, 4, 40, 100. 

as a function of u for different values of z. In fact, for z G (0, 1) there are highly connected 
nodes in the network and the degree distribution for large degrees k decays as a power-law 
with an exponent 7 fixed by the value of z. On the other side, for small value of the degrees 
k the degree distribution deviates from the simple power-law and depends strongly on 
the value of the Lagrangian multiplier For low average connectivities, i.e. 

(k) = {k)o < ^ (16) 

the solution of Eqs. f|T5|) involves a negative value of u. Accordingly, low degree nodes are 
more frequent than expected by a simple power-law while for (k) > (A;)o, ly > and the low 
connected nodes are less probable than predicted by a simple power-law. 

In Figure [2] we show the optimal distributions which solve these equations at different 
values of the average degree (k). 

Given the optimal distributions f|T4|) we can calculate the energy of the network as a 
function of z at fixed average degree (k). In Figure [3] we present the energy of the optimal 
graph as a function of z for different average connectivities. 

The energy has a minimum in the limit z — > when the optimal degree distribution is 
infinitely peaked around the average, and thus the graph is a random regular graph. On the 
contrary, in the limit z ^ 1 where the degree distribution has a power-law exponent 7 — >■ 2 
the energy E{{Nk}) is at the maximum. 

From the derivation of our model it is evident the connection with "ball in the box" 
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FIG. 3: Energy of the network as a function of the parameter 2:, for an average degree (fc) = 4, 6, 
in the hmit N ^ 00. 



problems 
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24l |. In our approach the "boxes" map to the degree of the nodes and 



the "balls" map to the edges of the graph. This makes a crucial difference respect with the 



model 



22| . in which the "boxes" map to the nodes of the graph. Although their formal 



similarities this difference make the two model very different in their conclusions. 

We would like to indicate here that if we introduce by hand a structural cutoff K ~ N^^'^ 
we can assume that the networks described in this paper are randomly wired. In this case 
we can evaluate the number JVsg of distinguishable simple graphs is it possible to construct 
given the degree distribution f|T4|) . This number is approximated by 



so oc 



i2L)\\e 



(17) 



In fact the total number of wiring it is possible to draw given 2L edges is given by (2L)!!. 
This number include all type of possible wiring of the edges including the ones which give 
rise to graphs which are not simple. Assuming that the graph is randomly wired, i.e. that 
the probability that a node with ki edges connect to a node with kj edges is a Poisson 
variable with average kikj/{< k > N) the probability 11 that the graph is simple is equal to 



25| 



n = n 1 + 



<k> N 



-kikj<k>N 



1 / <fc^> 

2 V <fc> 



(18) 



Finally in the expression (ITTI) for Msg there is an additional terms which takes into account 
the equivalent wiring of the edges which is given by e''^^''^^'-'^^ The term for scale-free 
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graphs with cutoff K oc N'^l'^ is subleading respect to the energetic term E[{N}S^) which 
dominates for large network sizes A^. 

Consequently the total number of distinguishable simple graphs Msg in an annealed 
approximation, is then a decreasing function of 7 suggesting that in random scale free 
graphs the space of distinguishable simple random graphs is strongly suppressed as 7 — 2. 

In conclusion, the statistical mechanics treatment of complex networks shown in this 
paper is able to put in a similar context, the emergence of scale- free networks and finite-scale 
networks. Scale-free degree distribution correspond to higher energy state of the network 
respect to finite-scale networks. Especially regular random graphs have minimal energy. 
Consequently the large variety of real complex networks in many technological, biological 
and social systems which show a scale-free degree distribution reveal a tendency have a 
degree distribution which maximize the energy. Furthermore we give an estimation of the 
annealed average size of the space of simple graphs Msg for random scale-free networks 
showing that Msg is strongly reduced as 7 ^ 2. 
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